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Abstract 

We present an algorithm for effectively generating binary sequences 
which would be rated by people as highly likely to have been generated 
by a random process, such as flipping a fair coin. 

1 Introduction 

This paper is a first step in modelling mathematical objects showing "sub- 
jective randomness" , or what people believe to be random. Although there is 
no rigorous characterization of what subjective randomness might be, it has 
become clear through experimentation that is quite different from stochastic 
randomness. A classic example which illustrates this difference is the fol- 
lowing: when asked which of the following sequences is most likely be to 
produced by flipping a fair coin 20 times, 

ooooooxxxxoooxxooooo 
ooxoxooxxooxoxxxooxo 

most people will answer "the second sequence" even though each sequence 
has been produced by a random generator. 

Until now, subjective randomness has mainly been the study of psychol- 
ogists, cognitive scientists and artist^. However, in today's age, where com- 
puter software is an integral part of everyday life, it is a natural problem 
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^see Twyla Tharp's "Medley" or any of Jackson Pollock's drip paintings 
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to ask how one can present what people would accept as "random". Ex- 
amples of this interest is the popularity of the various "random" playlist 
shuffling software on the market and the appearance of "randomness" in de- 
sign (screensavers, tiling, etc.). In fact, this paper grew from an attempt to 
create exercise drill software for students. The goal was to generate images 
which would be simultaneously unpredictable and yet, in some way, bal- 
anced. We hope to be able to describe a good model for subjectively random 
two-dimensional objects sometime in the future. For now we present one 
for which there already exists a substantial body of research: subjectively 
random binary sequences. 

Most of the research on subjective randomness seeks to understand what 
exactly are the differences between subjective randomness and stochastic 
randomness and to understand why this is so. Experiments usually are of 
two types: "production" where subjects are asked to produce examples of 
what they consider to be random and "judgment" where subjects are asked 
to identify or rate objects based on how likely they would have been produced 
by some random process. From these experiments, a few traits of subjective 
randomness have been pinpointed. 

One is local representativeness the fact that people feel that a small 
sample should reflect properties of the population as a whole [KT], [Rl], 
[TK] . In terms of binary sequences, this translates to concrete restrictions on 
the probability of alternation and the relative number of X's and O's in any 
given subsequence. 

The second trait is high subjective complexity or effort needed to en- 
code the data [C], [F2], [L3]. In the same way that algorithmic (Kolmogorov) 
complexity is measured as the inability of a computer to encode the sequence 
in less bits than the length of the sequence [LV] , so can perceived randomness 
be measured as the inability of the human mind to memorize the sequence 
in significantly less steps than there are sequence elements. 

The third trait concerns symmetry recognition. People are less likely 
to rate a sequence as random if they recognize certain symmetries in them 
[F3], [LO], [S], [R3], [T]. Which symmetries are recognized depends on whether 
the sequence elements are presented simultaneously or one bit at at time. 

In studying those sequences which are subject to the above constraints, 
it becomes apparent that subjectively random binary sequences are anything 
but random in the sense that they have properties which fall far from the 
norm for Bernoulli trials [LV]. This is also the reason that they could not 
be the output of any pseudo-random generator [K], [LI], [L2]; for one, these 
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sequences only have very short runs of X's and O's. Although the locally 
even distribution of X's and O's in subjectively random sequences resembles 
that of low discrepancy, or quasi-random, sequences, they are not quasi- 
random because the alternation rate of X's and O's is too high from normal 
[CG]. The structure of subjectively random sequences is quite rigid and these 
sequences constitute a very small subset of all binary sequences. In fact, this 
set is so small (less than 10% of all sequences of length 20 or more bits) 
that it makes more sense to use a creation method than some brute force 
method using a random generator. We use all of the above conditions in 
creating a simple algorithm which generates (arbitrarily long) subjectively 
random binary sequences. We concentrate on those sequences which would 
be presented one bit at a time, such as in a ticker-tape or in the case when 
someone is actively flipping a coin. 

Our algorithm depends on a function which measures subjective random- 
ness. This function is itself a variation on two ways of measuring subjective 
randomness, one developed by Falk and Konold [FK], the other developed 
by Griffiths and Tenenbaum [GT2] but is more efficient in measuring sub- 
jective randomness on 8-bit sequences. The sequence is created one bit at 
a time; at each step one chooses the element which will produce the desired 
randomness rating on the last 8 bits of the sequence. Producing this se- 
quence is equivalent to moving along paths in a certain associated digraph. 
By traveling within a strongly connected digraph associated to sufficiently 
random 8-bit sequences, one can produce arbitrarily long binary sequences 
that consistently look "random enough" . 

In the first few sections of the paper we give brief accounts of the re- 
search results which will be considered in creating our subjectively random 
sequences. These sections restrict themselves to describing specific results 
and are not surveys of the substantial body of research in subjective random- 
ness done by psychologists and cognitive scientists. For that, we recommend 
articles by Bar-Hillel and Wagenaar [BW], [W], Falk and Konold [FK] and 
by Nickerson [N]. 

We thank T. Griffiths for sharing data and answering questions and the 
referee for many useful comments and suggestions. 
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2 Local Representativeness 



In their studies of how people perceived randomness, Kahneman and Tversky 
argued that, generally, people believe that small samples should reflect the 
properties of the population as a whole. In probability theory, the Law of 
Large Numbers states that large samples tend to have the same properties 
as the population as a whole. Kahneman and Tversky coined the term "The 
Law of Small Numbers" to describe people's belief that this holds even for 
small samples [KT]. When small samples don't behave expectedly, people 
believe that there must be another reason behind the data. For example, 
gamblers attributed "luck" (good or bad) for streaks in outcomes [WK] . 

In the case of binary sequences the Law of Small Numbers presents itself 
in the following way: people believe that in even short random sequences 
there must be roughly the same number of X's as there are O's with some 
irregularity in the order of their appearance. In other words, neither pure 
runs, such as XXXXXX or 000000, nor alternating runs, such as XOX- 
0X0X0 or 0X0X0X0, should be too long. When considering long binary 
sequences, these same features should be present in short subsequences, that 
is, in people's windows of observation. These windows are believed to be of 
variable length of approximately 7 bits [KG] . 

These conditions account for a high alternation rate in subjectively ran- 
dom binary sequences. Truly random sequences tend to have an alternation 
rate of 0.5, but studies consistently show that subjectively random sequences 
have a probability of alternation of 0.6 - 0.7 [BW], [Fl], [FK]. One way of 
understanding why this could be so is that, since there should consistently 
be about as many X's and there are O's, if there is a run of X's. then the run 
shouldn't be too long. This forces the probability of alternation to be higher 
than average. On the flip side, the restriction on the length of alternating 
runs forces the probabihty of alternation to not become too high. 

3 Subjective Complexity 

In [FK] , Falk and Konold flnd that people tend to view a sequence as being 
more random if it was harder to mentally encode it. For example, the se- 
quence XOXOXOXOXO could be described as "five times XO" . However, the 
sequence XOOXXOXXOO cannot be described so concisely: "first an X, then 
two O's, then two X's, an 0, then two X's, then two O's" . In order to ob- 
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jectively measure this effort, Falk and Konold define a "difficulty predictor" 
dp. Let a; be a sequence. Tlien 

dp(x) = tfie number of pure runs + 2 * tfie number of alternating runs in x 

The idea behind this measure is that one uses shortcuts such as "four X's" or 
"XO five times" to describe the sequences. The difficulty predictor measures 
how many different commands one would have to give, taking into account 
the greater effort one needs to check the length of alternating runs over pure 
runs (hence the factor 2). If there arc multiple ways of describing the sequence 
X as concatenation of runs, then dp(a;) is defined to be the minimum of the 
resulting numbers. 

Example: Consider x = XXXOXOXOOO. Then we can view x as 

XXX 0X0X000 or XXXOXOXOOO or XXXOXOXOOO 



The first way rates x's difficulty as 6, the second two rate it as 4. So dp(a;) = 
4. 

In trials where subjects were asked to memorize and copy various se- 
quences of length 21, Falk and Konold found that their difficulty predictor 
was a good correlator for the randomness rating that was given by the sub- 
ject and concluded that it was a good measure of the sequence's perceived 
randomness [FK]. 

Consistent with previous studies, Falk and Konold also found that those 
sequences rated as most random tended to have a higher than average prob- 
ability of alternation. It should be noted that this is, mathematically, not a 
coincidence. Requiring a maximal value for dp is a stronger condition than 
simply requiring that the probability of alternation be 0.6-0.7 and that the 
imbalance (defined as | sequence length/2 — #X's|) be minimal. 

In a series of papers [GT1],[GT2],[GT3], Griffiths and Tenenbaum extend 
the results of Falk and Konold with their own measure of subjective com- 
plexity. They propose that, when presented with a binary sequence x, people 
assess the probability that x is being produced by a random process as op- 
posed to being produced by some other (regular) process. Their measure 
for subjective complexity is then P(x| regular), (where x is considered more 
random, the smaller P(,T|rcgular) is). 

In experiments where they ask subjects to order 8-bit sequences with re- 
spect to how random they believe they are, Griffiths and Tenenbaum first 
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show that a good model for calculating P(x|regular) is a finite state automa- 
ton in the form of a certain hidden Markov model (HMM)[GT2]. Conceptu- 
ally, the model works as follows: As someone reads a sequence x, they will 
consider each element, or X, as possibly being part of one of several motifs. 
These motifs are of varying length and the probabihty that or X will be 
considered as being part of a specific motif will be a function of the motif 
length and whether one is changing from one motif to another. 

Example: In the smallest HMM model they consider [GT2], the 6 state 
model, there are the following states which produce an as observed occur- 
rence: (coming from the motif 0), (coming from the motif OX) and 
(coming from the motif XO). Similarly, there are 3 states which produce X. 
The probability of remaining in a motif is denoted by S, the probability of 
changing to (or starting at) a motif of length k is equal to C ■ a'^ (for some 
a) where C := {1 — d) / {2a + 2a^) is a normalization constant. 

Let 1 denote the state which produces X (from the length 1 motif X), 2 
denote the state which produces (from the motif 0), 3, resp. 4, denote the 
state which produces X, resp. 0, from the motif XO, and 5, resp. 6, denote 
the state which produces X, resp. 0, from the motif OX. The transition 
matrix giving P(i|j) of going from state i to state j is: 



^(i|j) = 
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In the same spirit as for the difficulty predictor, there are many differ- 
ent sequences of states which can produce the same observed sequence. For 
example, the sequence x — XOXX could be produced by any of many state 
sequences: the first X could come from state 1 or 3, the could come from 
states 2, 6 or from state 4 if the preceding X was from state 3. the sec- 
ond X could come from state 1 or 3 or from state 5 if the preceding 
was from state 6, and the third X could come from state 1 or 3 (if the 
previous X was not also from state 3). For each state sequence z, the prob- 
ability P(x, z) that the observed sequence x was produced from z will be a 
function of S and a. For example P(XOXX, 1211) = (Ca)^ = C^a^ and 
P(XOXX, 3431) = Ca'6'Ca = C'a^d'. The probabihty P(a;|regular) is de- 
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fined as the max^ P{x, z) over all possible state sequences which produce x. 

The 22 state model is the natural extension of the 6 state model to include 
up to length 4 motifs which are not duplications of smaller length motifs (such 
as XOXO or XXXX). 

Both the 6 state and the 22 state models have the advantage over the 
difficulty predictor of being a function of the length of a sequence. For 
certain values of S and a, Griffiths and Tenenbaum show that there is a 
equivalence of the 6 state model and Falk and Konold's difficulty predictor. 
In addition, they find values of 6 and a such that both HMMs modelled 
the trials' subjective randomness results better than the difficulty predictor 
[GT2]. As with the difficulty predictor, being rated maximally random (by 
the 22 state model) is a stronger condition than requiring that the sequence 
has probability of alternation 0.6-0.7 and that the imbalance between O's 
and X's is low. 

4 Symmetry recognition 

Griffiths and Tenenbaum's experiments also test the role of symmetry recog- 
nition in subjective randomness and consider the possibility that subjects 
recognize four types of symmetry: mirror symmetry, where the second half 
is produced by reflection of the flrst half (ex. XXOOOOXX), complement 
symmetry, where the second half is produced by reflection of the flrst half 
and exchanging X and (ex. XOOXOXXO), and duphcation, where the se- 
quence is produced by repeating the first half once (ex. XXOXXXOX) . Their 
"context-sensitive" model for P(a;|regular) considers that sequences be gener- 
ated by any of four methods. The first method is repetition, where sequences 
are produced by the HMM. The next three methods are the symmetry meth- 
ods listed above, where the first half of the sequence is produced by the HMM 
and the second half by the symmetry rule. Then, P(x| regular) will depend 
on the models M hsted above: 

P(x|regular) = maxP(a;, ^|M)P(iV/) 

z,M 

where P{x,z\M) is obtained as above and P{M) is to be determined by 
analysis of hard data. 
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In [GTS] , they conclude that when all elements of a sequence are presented 
simultaneously, their context-sensitive model models subjective randomness 
best with the following parameters: 6 = 0.66, a = 0.12, P(repetition) = 
0.748, P(mirror symmetry) = 0.208, P(complement symmetry) = 0.0005 
and P(duplication) = 0.039. However, when elements of a sequence are 
presented sequentially, people do not recognize mirror or complement sym- 
metry, so the best model has the following parameters 6 = 0.70, a = 0.11, 
P(repetition) = 0.962, and P(duphcation) = 0.038. 

5 Measuring subjective randomness effectively 

Griffiths and Tenenbaum find values for a and 6 in order to fit their model to 
the linear ordering on the sequences which is obtained by the experimental 
data. However, for our problem we do not need the full structure of their 
model; since we are interested in the sequences rated "more random" , we only 
need the linear ordering of the sequences produced by this model. An analysis 
of the HMM model shows that we obtain the same ordering as given by the 22- 
motif HMM by doing the following. We consider a and 6 as formal parameters 
satisfying < a < 5^ < 1. The normalizer C can be set to C = 1. This 
abstract version of the HMM will assign to each sequence an expression of 
the form Let x and y be two sequences such that P(x|regular) = a^^6^^ 
and P(j/|regular) = a^^5^^ (where the probability P(- [regular) is determined 
by the HMM). Then, x is subjectively more random than y if ^2 — "^i > and 

Ji - 32 < 4(^2 - tS 

With regard to the repetition and duplication models, we need to set 
a6'^ < P(duplication)/P(repetition) < aS^. To obtain the same ordering as 
in [GT3], it suffices to set P(repetition) = 1 and P (duplication) = a6^-^. 
(In our next paper, we will consider the simultaneous case which includes all 
four models). 

Table 1 gives the partial ordering of all 128 sequences starting with 
from most to least subjectively random.The first column gives an example 
sequence in full and the second column gives the list of equally rated se- 
quences in the form of a base 10 number. The third column gives the "finite 
state" P(x|regular) as determined by the 22 motif HMM. The fourth column 

^This does not give a partial ordering on all monomials in a and 6 but simply on those 
produced by this model. 
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gives the "context-sensitive" P(x|regular) which also takes into consideration 
dupUcation symmetry. 

This abstract version of the 22-motif HMM allows us to define a more 
efficient variant dp_)_, which will give the same result as the 22-motif HMM; 
if the HMM assigns a sequence x the value a'^S^ then this predictor will 
assign it the value where i represents the sum of the lengths of the 

contributing motifs and the j represents the length of the sequence minus 
the number of contributing motifs. However, it will "read" the sequence 
in the same spirit as Falk and Konold's difficulty predictor. In addition 
to length 1 and 2 motifs, dp+ considers length 3 and 4 motifs and records 
over how many bits the motif appears. For example, viewing the sequence 
XOOXOOXX as two instances of XOO and two instances of X gives a value of 
[3 + 1, 8 — 2] = [4, 6] Here, we allow incomplete repeats to be included into the 
run. For example, the sequence XOXOXOX would be given a value of [2, 6] 
because it is a run of 3 1/2 instances of the length 2 motif XO. The rating 
given to a sequence x would be the smallest obtained by considering 
X as various concatenations of motifs. The part of dp+ consisting of the 
initial conditions and Step 1 is equivalent to the abstract 6 state HMM (with 
conditions C = 1 and < a < 5^ < 1). The reason that dp+ is more efficient 
(for this problem) than the HMM is that it identifies those subsequences 
(and motifs) which will contribute to the value of dp+. Hence, over 8-bit 
sequences, it is recursive with small depth. For any object c described using 
bracket notation [■■■], we will write c[i] to denote its ith component or entry. 
The algorithm for dp+ is as follows: 

Algorithm to compute dp+: 
INPUT: X = XiX2- ■ -Xn- 

# Special cases: 

IF X is empty THEN OUTPUT [0,0] 

ELSE IF X is a pure run THEN OUTPUT [l,n-l] 

ELSE IF X is an alternating run and n > 3 THEN OUTPUT [2,n-l] 

# consider only length 1 and 2 motifs 

ELSE SET j := 1 

# find the largest j >l such that Xi---Xj consists of j repeats 
of Xi . 

WHILE j <n AND xj = Xi DO 

INCREMENT j BY 1 
END DO 
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IF j > 1 THEN 

si := X1X2 ■ ■ ■ and s2 :— Xj • ■ • Xn- 
ELSE SET k -.= 1 

# find the largest k >1 such that X1X2 ■ ■ ■ X2k is k repeats of 
X1X2 . 

WHILE k <n AND X2k-i = Xi AND X2k = X2 DO 

INCREMENT ^ BY 1 
END DO 

IF A; = 1 THEN 

si := Xi and s2 :— X2 • • • Xn ■ 
ELSE si := X1X2 ■ ■ ■ X2k and s2 := X2k+i ■ ■ - Xn. 
END IF 
END IF 
END IF 

SET dp+'(a;) = dp+(sl) + dp+(s2) . 

# consider length 3 motifs 

SET Dset = {dp+'(x)} # collects possibly smaller difficulty values 
IF dp+'(x)[l] > 4 THEN 

FOR i FROM 1 TO n - 5 DO 

# determine the largest j > i + 4: such that Xi...Xj is a 
repeat of XiXi+iXi+2 ■ 

SET j := i + 3 

WHILE j <n AND Xj = Xi+(^(j^i) mod 3) DO 

INCREMENT j BY 1 
END DO 

IF J - i > 4 THEN 

LET Dset := Dset UNION {dp+(j;i • • • x^.i) + [3, j - i - 2] + 

dp+(xj+i • • - Xn)} 

END IF 
END LOOP 
END IF 

SET dp+"(a;) = min^gDset 

#consider length 4 motifs 

SET Dset = {dp+"(x)} # collects possibly smaller difficulty values 
IF dp+"(a;)[l] > 5 THEN 

FOR i FROM 1 TO n - 6 DO 
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# determine the largest j > i + 5 such that 
repeat of XjXj+iXi+S'^j+s • 



SET j := i + 4 

WHILE j <n AND Xj = Xi+(^(j_i) mod 4) DO 

INCREMENT j BY 1 
END DO 

IF j -i>7 THEN 

LET Dset:=Dset UNION {dp+{xi- ■ ■Xi-i) + [AJ -i-2] + 



dp+{xj+i ■■■Xn)} 

END IF 
END LOOP 
END IF 

OUTPUT: min^eDset'y 
END IF 

Using MAPLE, we found that, dp_|_ took significantly less time and mem- 
ory than the (polynomial time) Viterbi algorithm (see [R2] for a good tutorial 
on the Viterbi algorithm) with the 22 motif HMM. On average, the Viterbi 
algorithm takes 47 times the time and 400 times the space as dp+ to cal- 
culate the difficulty of an 8-bit sequence. We also note that this algorithm 
duplicates the results of the 22 motif model only for sequences of 8 bits or 
less. It becomes less accurate as the length of the sequence increases. 

Taking into account the possibility of duplication (x = X1X2 ■ ■ ■ Xn/2XiX2 ■ ■ ■ a;„/2), 
we obtain the subjective randomness rating sy{x) for any 8-bit sequence x: 



min(dp+(a;), dp+(a;iX2 • • • 2^/1/2) + [1, 3.5]) if x is a perfect duplication. 



This gives the same partial ordering on 8- bit sequences as the "context- 
sensitive" model described in Section 3. 

6 Sequence Creation 

We can now produce arbitrarily long subjectively random binary sequences. 
In this paper, we concentrate on those sequences which would be presented 
sequentially (one bit at a time) instead of simultaneously. According to [GT3] 
(see above), in addition to the rating given to the sequence by the HMM, the 
only symmetry that needs to be considered then is duplication. 




otherwise 
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The idea behind producing this sequence is the obvious one: pick the next 
element in a sequence (0 or X) to be the one which gives a sufficiently high 
rating of subjective randomness on the last 8 bits of the resulting sequence. 
The rationale behind this is that since people use small (6-8 bits) windows 
of observation when analyzing a sequence, it suffixes that people feel at any 
one time that their window looks "random" enough. 

Ideally, one would like to create sequences which would be of maximal sr 
over every small subsequence. However, such sequences simply do not exist. 
Even restricting oneself to sequences for which every 8-bit subsequence x is 
in the top 20 % (sr(a;)[l] = 5) becomes too deterministic; there are only 2 
such (arbitrarily long) sequences and each has a period of 6 bits. 

For any 8-bit sequence x = Xi---Xs, let xO := a;2---a;80 and xX : = 
X2 - ■ ■ XgX. (If X is the last 8 bits of the sequence that we arc creating, then 
xO or xX will be the last 8 bits of the sequence to which we've added one more 
element). To any subset -S" of 8- bit sequences, we can associate a directed 
graph, or digraph, G :— G{S) in the following way: the vertices of G are 
indexed by the elements of S. The directed edges are defined as follows: 
X ^ y if y = xO E S or y = xX G S. 

Recall that a digraph G is connected, resp. strongly connected, if, for every 
two vertices x y inG, there is a nondirected, resp. directed, path from x to 
y. A subgraph C of G is a connected, resp. strongly connected, component 
if it is maximal for this property. A component is trivial if it consists of 
a single vertex. For our problem, we are particularly interested when G is 
strongly connected. In this case, we can form arbitrarily long sequences x by 
choosing an equally long directed path in G. The strong connectivity implies 
that there are no sinks and thus guarantees the existence of such a path. 

Set S{[m,n]) := {x\x 8-bit , sr(a;) > [m, n]} for [m, n] any of the possible 
sr values [1, 7], [2, 6], . . .,[5, 6]. The following theorem is proved by brute force 
(Maple). 

Theorem 11) For [m,n] < [5,3], G{S{[m,n])) has exactly one non-trivial 
strongly connected component C{[m,n]) and C([5,3]) has exactly two non- 
trivial strongly connected components Ci([5, 3]), C2([5, 3]). These components 
form a sequence of nested digraphs: 

C{[1, 7]) D G{[2, 6]) D • • • D C([4, 6]) D Q([5, 3]), ^ = 1, 2. 

G{S{[m,n\)) has no non-trivial strongly connected components for [m,n\ > 
[5,3]. 
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2) For [m, n] < [4,4.5]; the C([m, n]) are also connected components of 
G{S{[mM))- 

Table 2 gives detailed information about the C([m, n]). A highly efficient 
way to produce sequences would be to use an incidence matrix (or table) M 
for the graph C([m,n]), indexed by its vertices, with entries M^^y equalling 
the probability of traveling from x to y. A sequence would then be created 
by randomly choosing a vertex of C([m,n]) (which gives the first 8 bits of 
the sequence) and then moving to each consecutive vertex/bit as dictated by 
M. This method is fast (a million bits in 347 seconds using Maple) and it 
produces every possible sequence. However it is rigid; if one wants to tinker 
with the set from which these subsequences come (by making it larger), it 
would be preferable to use a method that it based on built-in bounds. The 
success of such a method would depend on being always able to remain within 
a subgraph which is connected and strongly connected. The subgraph being 
connected allows one to use the algorithm's simple bounds condition and still 
remain in the graph. The strong connectedness prevents one from travelling 
to a sink. The algorithm is then straightforward: 

Sequence Creation Algorithm: Input an initial x = xi - ■ ■ from C([m, n]) 
and N, the desired sequence length. For i from 9 to A'^ do: 

• if sr(xX) > [m, n\ AND sr(xO) < [m, n] then set Xi — X 

• if sr(xO) > [m, n\ AND sr(a;X) < [m, n] then set Xi — 

• otherwise choose = or X at random. 

From the above Theorem, we see that we can use this algorithm for paths 
within C([m,r2]), [m,n] < [4,4.5]. (Otherwise, we need more complicated 
conditions to ensure that we remain within C([m,n]).) We argue that, for 
very long sequences, the best choice is the 164-element set C([4, 4]). This 
might at first seem to be too large of a set. However, any 20-bit sequence 
produced from C([4, 4]) is in the top 9% (of subjective randomness) of all 20- 
bit (or more) sequences. C([4, 4]) also has a large cycle basis which allows for 
a great variety of sequences (see [GLS] for results concerning cycle bases for 
digraphs). Since the proportion of 7^ cycles/ # vertices is significantly larger 
for C([m, n]) (see Table 2) than it is for any C{[m,n]) for [m, n] > [4,4] 
it makes sense to choose C([4, 4]) to make these sequences. Almost any 
7-bit sequence can serve as the first seven bits of an element in C([4, 4]). 
The exceptions are 1) any sequence with a streak of length 5 or more, 2) 
those 7-bit sequences with sr[l] < 2, and 3) the sequences 000X000 and 
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XXXOXXX. With such an initial sequence and bounds [m, n] = [4, 4] in the 
above algorithm, we produce sequences always within C([4,4]). 

On average, such subjectively random sequences coming from C([4,4]) 
have equal numbers of X's and O's and an alternation rate of .58. Examples 

of such STibjcctivcly random sequences coming from arc: 




which is consistently more "balanced" than what is produced by a random 
generator: 




7 Conclusion 

There are many questions that one might ask concerning the sequences pre- 
sented here as subjectively random. For example, wouldn't people notice 
periodicity with motifs of length 5 or more? Would a HMM with more motifs 
give a more accurate model for subjective randomness? Would it be better 
to use larger "windows" when evaluating a sequence's subjective random- 
ness? How do the conditions for subjective randomness change as sequences 
become longer? Would these sequences be, in the long run, too regular (and 
thereby too predictable)? 

These and similar questions bring us to the main problem for this pro- 
gram, that is, the lack of hard results which are geared to this type of mod- 
eling. Simply put, most research done on subjective randomness seeks to 
answer questions which have little to do with producing subjectively random 
objects. The algorithm presented here is based on what information there is. 
However, more research would have to be done to determine how to fine-tune 
it, if necessary. 
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sample sequence 


all sequences with same random- 


finite 


context- 




ness rating 


state 


sensitive 


[0. 1. 0. 0. 1. 1. 0. 1] 






(\-'()^'' 


[0, 1, 1, 0, 1, 0, 0, 1] 


105 


a^5^ 


a^5^ 


[0, 1. 1. 0. 1. 0. 1. 1] 


41, 69, 74, 82, 89, 93, 101, 107, 


a^S^ 


a^6^ 


[0, 1, 1, 1, 0, 1, 1, 0] 


38, 44, 46, 50, 52, 66, 70, 76, 78, 
98, 100, 110, 114, 116, 118 


aH^ 


aH^ 


[0, 1, 1, 0, 1, 1, 0, 0] 


18, 22, 37, 45, 54, 72, 75, 90, 91, 
104, 108 






[0, 1, 1, 1, 1, 0, 1, 0] 


20, 26, 40, 43, 53, 58, 81, 83, 86, 
88, 92, 94, 106, 117, 122 






[0, 1, 1, 0, 0, 1, 1, 0] 


34, 68, 102 






[0, 1, 1, 1, 1, 1, 0, 1] 


9, 11, 13, 19, 23, 25, 27, 29, 33, 
35, 39, 47, 49, 55, 57, 59, 61, 65, 
67, 71, 79, 97, 99, 103, 111, 113, 
115, 121, 123, 125 






[0, 1, 1, 0, 1, 1, 0, 1] 


36, 73, 109 






[0, 1, 0, 1, 1, 1, 1, 1] 


5, 10, 21, 42, 80, 84, 87, 95 






[0, 1, 1, 1, 0, 1, 1, 1] 


17, 51, 119 






[0, 1, 1, 1, 0, 0, 0, 0] 


2, 4, 6, 8, 12, 14, 16, 24, 28, 30, 
32, 48, 56, 60, 62, 64, 96, 112, 120, 
124, 126 






[0, 1, 0, 1, 0, 1, 0, 1] 


85 






[0, 1, 1, 1, 1, 1, 1, 1] 


1, 3, 7, 15, 31, 63, 127 






[0, 0, 0, 0, 0, 0, 0, 0] 










Table 1: The linear ordering of 8-bit binary sequences given by both "finite 
state" and "context-sensitive" models. Sequences Xi - • -Xn are written in 
base 10: 2"'-^xi H h 2x„_i + x^ 
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r -1 

[m, n\ 


G([m, n\) 


num. 


num. 


cycle basis 






vertices 


arcs 


cardinality 


[5,3] 


(7i([5, 3]) = {44, 89, 178, 101, 203, 150} 
C^2([5, 3]) = {77, 154, 52, 105, 211, 166} 


6 
6 


6 
6 


1 
1 


r A r-l 

[4, 6] 


Ci([5,3]) U C2([5,3]) U 

{22, 38, 41, 45, 46, 54, 69, 74, 75, 

Qo on OQ inn in/i inT i hq ion 
oz, yu, yo, iUU, iU4, iU / , iUo, lib, ioy, 

147, 148, 151, 155, 162, 165, 173, 180, 

181, 186, 201, 209, 210, 214, 217, 233} 


46 


58 


13 


[4,5] 


5([4, 5]) \ {18, 37, 50, 66, 72, 76, 91, 94, 
110, 118, 122, 133, 137, 145, 161, 164, 179, 

183,189,205,218,237} 


80 


120 


41 


[4, 4.5] 


5([4, 4.5]) \ {66, 94, 122, 133, 161, 189} 


102 


158 


57 


[4,4] 


5([4,4])\{190, 125,65, 130} 


164 


280 


117 


[3,7] 


5([3,7])\{190, 125,65, 130} 


170 


298 


129 


[3,6] 


5([3,6]) 


190 


342 


153 


[3,5.5] 


5([3,5.5]) 


196 


360 


165 


[3,5] 


5([3,5]) 


238 


462 


225 


[2,7] 


5([2,7]) 


240 


467 


228 


[2,6] 


5([2,6]) 


254 


505 


252 


[1,7] 


5'([1, 7]) = all 8-bit sequences 


256 


512 


254 



Table 2: Description of the strongly connected components C([m, nj) for each 
[m, n] for which C([m,r2]) is non-trivial. (Sequences X\ - ■ -x^ are written in 
base 10: T^'^xx H h 2a;„_i + a;„) 
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